20 resultados para Base Sequence

em Helda - Digital Repository of University of Helsinki


Relevância:

60.00% 60.00%

Publicador:

Resumo:

A repetitive sequence collection is one where portions of a base sequence of length n are repeated many times with small variations, forming a collection of total length N. Examples of such collections are version control data and genome sequences of individuals, where the differences can be expressed by lists of basic edit operations. Flexible and efficient data analysis on a such typically huge collection is plausible using suffix trees. However, suffix tree occupies O(N log N) bits, which very soon inhibits in-memory analyses. Recent advances in full-text self-indexing reduce the space of suffix tree to O(N log σ) bits, where σ is the alphabet size. In practice, the space reduction is more than 10-fold, for example on suffix tree of Human Genome. However, this reduction factor remains constant when more sequences are added to the collection. We develop a new family of self-indexes suited for the repetitive sequence collection setting. Their expected space requirement depends only on the length n of the base sequence and the number s of variations in its repeated copies. That is, the space reduction factor is no longer constant, but depends on N / n. We believe the structures developed in this work will provide a fundamental basis for storage and retrieval of individual genomes as they become available due to rapid progress in the sequencing technologies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Väitöskirjani käsittele mikrobien ja erilaisten kemikaalien rooleja saostumien ja biofilmien muodostumisessa paperi- ja kartonkikoneilla. "Saostuma" tässä työssä tarkoittaa kiinteän aineen kertymää konepinnoille tai rajapinnoille konekierroissa, jotka on tarkoitettu massasulppujen, lietteiden, vesien tai ilman kuljetukseen. Saostumasta tulee "biofilmi" silloin kun sen oleellinen rakennekomponentti on mikrobisolut tai niiden tuotteet. Väitöstyöni työhypoteesina oli, että i. tietämys saostumien koostumuksesta, sekä ii. niiden rakenteesta, biologisista, fysikaalis-kemiallisista ja teknisistä ominaisuuksista ohjaavat tutkijaa löytämään ympäristöä säästäviä keinoja estää epätoivottujen saostumien muodostus tai purkaa jo muodostuneita saostumia. Selvittääkseni saostumien koostumista ja rakennetta käytin monia erilaisia analytiikan työkaluja, kuten elektronimikroskopiaa, konfokaali-laser mikroskopiaa (CLSM), energiadispersiivistä röntgenanalyysiä (EDX), pyrolyysi kaasukromatografiaa yhdistettynä massaspektrometriaan (Py-GCMS), joninvaihtokromatografiaa, kaasukromatografiaa ja mikrobiologisia analyysejä. Osallistuin aktiivisesti innovatiivisen, valon takaisinsirontaan perustuvan sensorin kehittämistyöhön, käytettäväksi biofilmin kasvun mittaukseen suoraan koneen vesikierroista ja säiliöistä. Työni osoitti, että monet paperinvalmistuksessa käytetyistä kemikaaleista reagoivat keskenään tuottaen orgaanisia tahmakerroksia konekiertojen teräspinnoille. Löysin myös kerrostumia, jotka valomikroskooppisessa tarkastelussa oli tulkittu mikrobeiksi, mutta jotka elektronimikroskopia paljasti alunasta syntyneiksi, alumiinihydroksidiksi joka saostui pH:ssa 6,8 kiertokuitua käyttävän koneen viiravesistä. Monet paperintekijät käyttävät vieläkin alunaa kiinnitysaineena vaikka prosessiolot ovat muuttuneet happamista neutraaleiksi. Sitä pidetään paperitekijän "aspiriinina", mutta väitöstutkimukseni osoitti sen riskit. Löysin myös orgaanisia saostumia, joiden alkuperä oli aineiden, kuten pihkan, saippuoituminen (kalsium saippuat) niin että muodostui tahmankasvua ylläpitävä alusta monilla paperi- ja kartonkikoneilla. Näin solumuodoiltaan Deinococcus geothermalista muistuttavia bakteereita kasvamassa lujasti teräskoepalojen pintaan kiinnittyneinä pesäkkeinä, kun koepaloja upotettiin paperikoneiden vesikiertoihin. Nämä deinokokkimaiset pesäkkeet voivat toimia jalustana, tarttumisalustana muiden mikrobien massoille, joka selittäisi miksi saostumat yleisesti sisältävät deinokokkeja pienenä, muttei koskaan pääasiallisena rakenneosana. Kun paperikoneiden käyttämien vesien (raakavedet, lämminvesi, biologisesti puhdistettu jätevesi) laatua tutkitaan, mittausmenetelmällä on suuri merkitys. Koepalan upotusmenetelmällä todettu biofilmikasvu ja viljelmenetelmällä mitattu bakteerisaastuneisuus korreloivat toisiinsa huonosti etenkin silloin kun likaantumisessa oli mukana rihmamaiseti kasvavia bakteereja. Huoli ympäristöstä on pakottanut paperi- ja kartonkikoneiden vesikiertojen sulkemiseen. Vesien kierrätys ja prosessivesien uudelleenkäyttö nostavat prosessilämpötilaa ja lisäävät koneella kiertävien kolloidisten ja liuenneiden aineiden määriä. Tutkin kiertovesien pitoisuuksia kolmessa eriasteisesti suljetussa tehtaassa, joiden päästöt olivat 0 m3, 0,5 m3 ja 4 m3 jätevettä tuotetonnia kohden, perustuen puhdistetun jäteveden uudelleen käyttöön. Nollapäästöisellä tehtaalla kiertovesiin kertyi paljon orgaanisesti sidottua hiiltä (> 10 g L-1), etenkin haihtuvina happoina (maito-, etikka-, propioni- ja voi-). Myös sulfaatteja, klorideja, natriumia ja kalsiumia kertyi paljon, > 1 g L-1 kutakin. Pääosa (>40%) kaikista bakteereista oli 16S rRNA geenisekvenssianalyysien tulosten perusteella sukua, joskin etäistä (< 96%) ainoastaan Enterococcus cecorum bakteerille. 4 m3 päästävältä tehtaalta löytyi lisäksi Bacillus thermoamylovorans ja Bacillus coagulans. Tehtaiden saostumat sisälsivät arkkeja suurina pitoisuuksina, ≥ 108 g-1, mutta tunnistukseen riittävää sekvenssisamanlaisuutta löytyi vain yhteen arkkisukuun, Methanothrix. Tutkimustulokset osoittivat että tehtaan vesikiertojen sulkeminen vähensi rajusti mikrobiston monimuotoisuutta, muttei estänyt liuenneen aineen ja kiintoaineen mineralisoitumista.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

NMR spectroscopy enables the study of biomolecules from peptides and carbohydrates to proteins at atomic resolution. The technique uniquely allows for structure determination of molecules in solution-state. It also gives insights into dynamics and intermolecular interactions important for determining biological function. Detailed molecular information is entangled in the nuclear spin states. The information can be extracted by pulse sequences designed to measure the desired molecular parameters. Advancement of pulse sequence methodology therefore plays a key role in the development of biomolecular NMR spectroscopy. A range of novel pulse sequences for solution-state NMR spectroscopy are presented in this thesis. The pulse sequences are described in relation to the molecular information they provide. The pulse sequence experiments represent several advances in NMR spectroscopy with particular emphasis on applications for proteins. Some of the novel methods are focusing on methyl-containing amino acids which are pivotal for structure determination. Methyl-specific assignment schemes are introduced for increasing the size range of 13C,15N labeled proteins amenable to structure determination without resolving to more elaborate labeling schemes. Furthermore, cost-effective means are presented for monitoring amide and methyl correlations simultaneously. Residual dipolar couplings can be applied for structure refinement as well as for studying dynamics. Accurate methods for measuring residual dipolar couplings in small proteins are devised along with special techniques applicable when proteins require high pH or high temperature solvent conditions. Finally, a new technique is demonstrated to diminish strong-coupling induced artifacts in HMBC, a routine experiment for establishing long-range correlations in unlabeled molecules. The presented experiments facilitate structural studies of biomolecules by NMR spectroscopy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Schiff bases and their transition metal complexes are of significant current interest even though they have been prepared for decades. They have been used in various applications such as catalysis, corrosion protection, and molecular sensors. In this study, N-aryl Schiff base ketimine ligands as well as numerous new, differently substituted salen and salophen-type ligands and their cobalt(II), copper(II), iron(II), manganese(II), and nickel(II) complexes were synthesised. New solid state structures of the above compounds and the dioxygen coordination properties of cobalt(II) complexes and catalytic properties of three synthesised binuclear complexes were examined. The prepared complexes were applied in the formation of self-assembled layers on a polycrystalline gold surface and liquid-graphite interface. The effect of metal ion and ligand structure on the as-formed patterns was studied. When studying gold surfaces, a unique thiol-assisted dissolution of elemental gold was observed and a new thin gold foil preparation method was introduced. In the summary, synthesis, structures, and properties of Schiff base ligands and their transition metal complexes are described in detail and the applications of these reviewed. Assemblies of other complexes on a liquid-graphite interface and on a gold surface are also presented, and the surface characterisation methods and surfaces employed are described.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The analysis of sequential data is required in many diverse areas such as telecommunications, stock market analysis, and bioinformatics. A basic problem related to the analysis of sequential data is the sequence segmentation problem. A sequence segmentation is a partition of the sequence into a number of non-overlapping segments that cover all data points, such that each segment is as homogeneous as possible. This problem can be solved optimally using a standard dynamic programming algorithm. In the first part of the thesis, we present a new approximation algorithm for the sequence segmentation problem. This algorithm has smaller running time than the optimal dynamic programming algorithm, while it has bounded approximation ratio. The basic idea is to divide the input sequence into subsequences, solve the problem optimally in each subsequence, and then appropriately combine the solutions to the subproblems into one final solution. In the second part of the thesis, we study alternative segmentation models that are devised to better fit the data. More specifically, we focus on clustered segmentations and segmentations with rearrangements. While in the standard segmentation of a multidimensional sequence all dimensions share the same segment boundaries, in a clustered segmentation the multidimensional sequence is segmented in such a way that dimensions are allowed to form clusters. Each cluster of dimensions is then segmented separately. We formally define the problem of clustered segmentations and we experimentally show that segmenting sequences using this segmentation model, leads to solutions with smaller error for the same model cost. Segmentation with rearrangements is a novel variation to the segmentation problem: in addition to partitioning the sequence we also seek to apply a limited amount of reordering, so that the overall representation error is minimized. We formulate the problem of segmentation with rearrangements and we show that it is an NP-hard problem to solve or even to approximate. We devise effective algorithms for the proposed problem, combining ideas from dynamic programming and outlier detection algorithms in sequences. In the final part of the thesis, we discuss the problem of aggregating results of segmentation algorithms on the same set of data points. In this case, we are interested in producing a partitioning of the data that agrees as much as possible with the input partitions. We show that this problem can be solved optimally in polynomial time using dynamic programming. Furthermore, we show that not all data points are candidates for segment boundaries in the optimal solution.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Visual pigments of different animal species must have evolved at some stage to match the prevailing light environments, since all visual functions depend on their ability to absorb available photons and transduce the event into a reliable neural signal. There is a large literature on correlation between the light environment and spectral sensitivity between different fish species. However, little work has been done on evolutionary adaptation between separated populations within species. More generally, little is known about the rate of evolutionary adaptation to changing spectral environments. The objective of this thesis is to illuminate the constraints under which the evolutionary tuning of visual pigments works as evident in: scope, tempo, available molecular routes, and signal/noise trade-offs. Aquatic environments offer Nature s own laboratories for research on visual pigment properties, as naturally occurring light environments offer an enormous range of variation in both spectral composition and intensity. The present thesis focuses on the visual pigments that serve dim-light vision in two groups of model species, teleost fishes and mysid crustaceans. The geographical emphasis is in the brackish Baltic Sea area with its well-known postglacial isolation history and its aquatic fauna of both marine and fresh-water origin. The absorbance spectrum of the (single) dim-light visual pigment were recorded by microspectrophotometry (MSP) in single rods of 26 fish species and single rhabdoms of 8 opossum shrimp populations of the genus Mysis inhabiting marine, brackish or freshwater environments. Additionally, spectral sensitivity was determined from six Mysis populations by electroretinogram (ERG) recording. The rod opsin gene was sequenced in individuals of four allopatric populations of the sand goby (Pomatoschistus minutus). Rod opsins of two other goby species were investigated as outgroups for comparison. Rod absorbance spectra of the Baltic subspecies or populations of the primarily marine species herring (Clupea harengus membras), sand goby (P. minutus), and flounder (Platichthys flesus) were long-wavelength-shifted compared to their marine populations. The spectral shifts are consistent with adaptation for improved quantum catch (QC) as well as improved signal-to-noise ratio (SNR) of vision in the Baltic light environment. Since the chromophore of the pigment was pure A1 in all cases, this has apparently been achieved by evolutionary tuning of the opsin visual pigment. By contrast, no opsin-based differences were evident between lake and sea populations of species of fresh-water origin, which can tune their pigment by varying chromophore ratios. A more detailed analysis of differences in absorbance spectra and opsin sequence between and within populations was conducted using the sand goby as model species. Four allopatric populations from the Baltic Sea (B), Swedish west coast (S), English Channel (E), and Adriatic Sea (A) were examined. Rod absorbance spectra, characterized by the wavelength of maximum absorbance (λmax), differed between populations and correlated with differences in the spectral light transmission of the respective water bodies. The greatest λmax shift as well as the greatest opsin sequence difference was between the Baltic and the Adriatic populations. The significant within-population variation of the Baltic λmax values (506-511 nm) was analyzed on the level of individuals and was shown to correlate well with opsin sequence substitutions. The sequences of individuals with λmax at shorter wavelengths were identical to that of the Swedish population, whereas those with λmax at longer wavelengths additionally had substitution F261F/Y in the sixth transmembrane helix of the protein. This substitution (Y261) was also present in the Baltic common gobies and is known to redshift spectra. The tuning mechanism of the long-wavelength type Baltic sand gobies is assumed to be the co-expression of F261 and Y261 in all rods to produce ≈ 5 nm redshift. The polymorphism of the Baltic sand goby population possibly indicates ambiguous selection pressures in the Baltic Sea. The visual pigments of all lake populations of the opossum shrimp (Mysis relicta) were red-shifted by 25 nm compared with all Baltic Sea populations. This is calculated to confer a significant advantage in both QC and SNR in many humus-rich lakes with reddish water. Since only A2 chromophore was present, the differences obviously reflect evolutionary tuning of the visual protein, the opsin. The changes have occurred within the ca. 9000 years that the lakes have been isolated from the Sea after the most recent glaciation. At present, it seems that the mechanism explaining the spectral differences between lake and sea populations is not an amino acid substitution at any other conventional tuning site, but the mechanism is yet to be found.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The terrestrial export of dissolved organic matter (DOM) is associated with climate, vegetation and land use, and thus is under the influence of climatic variability and human interference with terrestrial ecosystems, their soils and hydrological cycles. The present study provides an assessment of spatial variation of DOM concentrations and export, and interactions between DOM, catchment characteristics, land use and climatic factors in boreal catchments. The influence of catchment characteristics, land use and climatic drivers on the concentrations and export of total organic carbon (TOC), total organic nitrogen (TON) and dissolved organic phosphorus (DOP) was estimated using stream water quality, forest inventory and climatic data from 42 Finnish pristine forested headwater catchments, and water quality monitoring, GIS land use, forest inventory and climatic data from the 36 main Finnish rivers (and their sub-catchments) flowing to the Baltic Sea. Moreover, the export of DOM in relation to land use along a European climatic gradient was studied using river water quality and land use data from four European areas. Additionally, the role of organic and minerogenic acidity in controlling pH levels in Finnish rivers and pristine streams was studied by measuring organic anion, sulphate (SO4) and base cation (Ca, Mg, K and Na) concentrations. In all study catchments, TOC was a major fraction of DOM, with much lower proportions of TON and DOP. Moreover, most of TOC and TON was in a dissolved form. The correlation between TOC and TON concentrations was strong and TOC concentrations explained 78% of the variation in TON concentrations in pristine headwater streams. In a subgroup of 20 headwater catchments with similar climatic conditions and low N deposition in eastern Finland, the proportion of peatlands in the catchment and the proportion of Norway spruce (Picea abies Karsten) of the tree stand had the strongest correlation with the TOC and TON concentrations and export. In Finnish river basins, TOC export increased with the increasing proportion of peatland in the catchment, whereas TON export increased with increasing extent of agricultural land. The highest DOP concentrations and export were recorded in river basins with a high extent of agricultural land and urban areas, reflecting the influence of human impact on DOP loads. However, the most important predictor for TOC, TON and DOP export in Finnish rivers was the proportion of upstream lakes in the catchment. The higher the upstream lake percentage, the lower the export indicating organic matter retention in lakes. Molar TOC:TON ratio decreased from headwater catchments covered by forests and peatlands to the large river basins with mixed land use, emphasising the effect of the land use gradient on the stoichiometry of rivers. This study also demonstrated that the land use of the catchments is related to both organic and minerogenic acidity in rivers and pristine headwater streams. Organic anion dominated in rivers and streams situated in northern Finland, reflecting the higher extent of peatlands in these areas, whereas SO4 dominated in southern Finland and on western coastal areas, where the extent of fertile areas, agricultural land, urban areas, acid sulphate soils, and sulphate deposition is highest. High TOC concentrations decreased pH values in the stream and river water, whereas no correlation between SO4 concentrations and pH was observed. This underlines the importance of organic acids in controlling pH levels in Finnish pristine headwater streams and main rivers. High SO4 concentrations were associated with high base cation concentrations and fertile areas, which buffered the effects of SO4 on pH.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Organocatalysis, the use of organic molecules as catalysts, is attracting increasing attention as one of the most modern and rapidly growing areas of organic chemistry, with countless research groups in both academia and the pharmaceutical industry around the world working on this subject. The literature review of this thesis mainly focuses on metal-free systems for hydrogen activation and organocatalytic reduction. Since these research topics are relatively new, the literature review also highlights the basic principles of the use of Lewis acid-Lewis base pairs, which do not react irreversibly with each other, as a trap for small molecules. The experimental section progresses from the first observation of the facile heterolytical cleavage of hydrogen gas by amines and B(C6F5)3 to highly active non-metal catalysts for both enantioselective and racemic hydrogenation of unsaturated nitrogen-containing compounds. Moreover, detailed studies of structure-reactivity relationships of these systems by X-ray, neutron diffraction, NMR methods and quantum chemical calculations were performed to gain further insight into the mechanism of hydrogen activation and hydrogenation by boron-nitrogen compounds.